OPTICAL DISC, RECORDING DEVICE 
AND REPRODUCING DEVICE 



BACKGROUND OF THE INVENTION 

5 1. Field of the Invention 

The present invention relates to a readable and writable 
optical disc, and to a recording device and a reproducing device for the 
optical disk. More particularly, our invention relates to an optical disc for 
recording multimedia data including moving picture data, still image data, 
10 and audio data, and to a recording device and a reproducing device for 
this optical disc, 

2. Description of the Related Art 

Rewritable optical discs have for years had a maximum 

15 Storage capacity of approximately 650 MB. but this has been changed by 
the development of phase change type DVD-RAM discs with a capacity of 
several gigabytes. Combined with the adoption of MPEG, and particularly 
MPEG-2, standards for encoding digital AV data. DVD-RAM is widely 
anticipated as a recording and reproducing medium with application In the 

20 AV industry as well as the computer industry. More specifically, DVD-RAM 
media are expected to replace magnetic tape as the storage medium of 
choice for AV recordings. 

A. DVD-RAM 

25 Increases in the storage density of rewritable optical disc 

media over the last few years has made it possible to use such media for 
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applications ranging from storing computer data and recording audio data 

to recording image data, including movies. 

The signal recording surface of a conventional optical disc is 

typically formatted with lands and grooves, one of which is used as a 
5 guide groove for signal recording and reproducing. The data signal is then 

recorded using only the land or the groove. With the advent of the land 

and groove recording method, however, it became possible to record 

signals to both the land and groove. This development approximately 

doubled the storage capacity of the disc. 
10 Further development of a zone CLV (constant linear velocity) 

method simplified and made it easy to implement a CLV recording and 

reproducing technique, an effective means of further increasing the 

recording density. 

A major topic left for future development is how to use such 
15 potentially high capacity optical disc media to record AV data containing 

image data to achieve new functions and performance far surpassing 

conventional AV products. 

With the introduction of high capacity rewritable optical disc 

media, optical discs are widely expected to replace conventional tape 
20 media for recording and reproducing AV content. The transition from tape 

to disc recording media is also expected to greatly affect both the 

performance and functions of AV recording and reproducing products. 

One of the greatest benefits of a transition to disc is a 

significant improvement in random access performance. While random 
25 access to tape content is possible, it generally takes on the order of 

minutes to rewind a full tape. This is several orders slower than the 

typical seek time of optical disc media, which is on the order of at most 
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several ten milliseconds. Tape is therefore considered, for practical 
purposes, not to be a random access medium. 

The random access capability of optical disc media has also 
made possible distributed, that is, noncontiguous, recording of AV data, 
5 which is not possible with conventional tape. 

Fig. 34 is a block diagram of the drive device of a DVD 
recorder. As shown in Fig. 34. this DVD recorder comprises an optical 
pickup 11 for reading data from the disc 10, an ECC (error correction 
code) processor 12, track buffer 13, switch 14 for changing track buffer 
10 input/output^ encoder 15, and decoder 16. An enlarged view of the disc 17 
format is also shown. 

As indicated by the disc 17 format, the smallest unit used for 
recording data to a DVD-RAM disc is the sector, which is 2 KB. Sixteen 
sectors are combined as one ECC block, to which the ECC processor 12 
15 applies error correction coding. 

The track buffer 13 is used for recording AV data at a variable 
bit rate in order to record AV data to a DVD-RAM disc more efficiently. 
While the read/write rate (Va) to a DVD-RAM disc is fixed, the bit rate 
(Vb) of the AV data is variable, based on the complexity of the AV data 
20 content (e.g., images if the AV data is video). The track buffer 13 is used 
to absorb this bit rate difference. This means that the track buffer 13 is 
unnecessary if the AV data bit rate is also fixed, as it is in the Video CD 
format. 

This track buffer 13 can be even more effectively used by 
25 dispersed placement of the AV data on the disc. This is explained with 
reference to Fig. 35. 
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Fig. 35 (a) shows the disc address space. If the AV data is 
recorded divided between contiguous area A1 between addresses a1 and 
a2, and contiguous area A2 between a3 and a4 as shown in Fig. 35 (a), 
the AV data can be continuously reproduced from these non-contiguous 
areas A1 and A2 by supplying data accumulated in the track buffer 13 to 
the decoder while the optical head seeks from a2 to a3. This is shown in 
Fig. 35 (b). 

Once reading AV data starts from a1 at time t1, it is both input 
to the track buffer 13 and output from the track buffer 13 with data 
accumulating in the track buffer at the rate (Va-Vb), that is, the difference 
between the input rate Va to the track buffer and the output rate Vb from 
the track buffer. This continues to address a2 at time t2. Assuming that 
the data volume accumulated to the track buffer at this time is B(t2), data 
supply to the decoder can continue until the data B(t2) accumulated to the 
track buffer is depleted at time t3 at which reading resumes from address 
a3. 

In other words, if it is assured that a certain volume of data 
([a1, a21) is read before a seek operation is performed, AV data can be 
continuously supplied to the decoder while the seek is in progress. 

It should be noted that this example considers reading, that is. 
reproducing, data from DVD-RAM, but the same concept applies for 
writing or recording data to DVD-RAM. 

It will thus be obvious that insofar as a specified amount of 
data is recorded continuously to DVD-RAM disc, continuous reproduction 
and recording is possible even if the AV data is noncontiguously recorded 
to the disc. 
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B. MPEG 

A common AV data format is described next below. 
As noted above, AV data is recorded to DVD-RAM media 
using the MPEG international standard, also known as ISO/IEC 13818. 

Even though DVD-RAM discs have a large, plural gigabyte, 
capacity, this is still not sufficient for recording uncompressed digital AV 
data of any duration. A way to compress and record AV data is therefore 
necessary. This need was addressed by worldwide adoption of the MPEG 
(ISO/IEC 13818) standard for AV data compression. MPEG decoders 
(compression/decompression ICs) have also been realized with advances 
in IC devices. This has enabled the DVD recorder to handle MPEG 
compression and decompression internally. 

MPEG signal processing Is able to achieve high efficiency 
data compression chiefly as a result of the following two features. 

First is that compression using a time correlation 
characteristic between frames (known as pictures in MPEG) is used in 
conjunction with conventional compression using a spatial frequency 
characteristic for moving picture data compression. Each video sequence 
of an MPEG video signal stream is divided into one or more groups of 
pictures, each group of pictures comprising one or more pictures of three 
different types: l-pictures (intraframe coded pictures), P-pictures 
(predictive-coded pictures, that Is, intracoded with reference to a 
preceding picture), and B-pictures (bidirectionally predictive-coded 
pictures, that is, intraframe coded with reference to preceding and 
following pictures). 

Fig. 36 shows the relationship between I, P, and B pictures. 
As shown in Fig. 36, P-pictures refer to temporally preceding I- or P- 



pictures in the sequence, while B^pictures refer to the first preceding and 
following I- or P-pictures. it should also be noted that because B-pictures 
reference an upcoming I- or P-picture, the display order of the pictures 
may not match the coding order of the pictures in the compressed data 
5 bitstream. 

The second feature of MPEG coding is that code size is 
dynamically allocated by picture unit according to the complexity of the 
image. An MPEG decoder has an input buffer, and by accumulating data 
in this decoder buffer a large amount of code can be allocated to complex 

10 images that are difficult to compress. 

Three types of audio coding are used for the audio portion of 
a DVD-RAM recording: MPEG audio with data .compression, Dolby 
Digital^^^ (also known as AC-3). and noncompressive linear pulse code 
modulation (LPCM). Both Dolby Digital<^> and LPCM are fixed bit rate 

15 coding methods, but MPEG audio coding can select from several 
compression rates on an audio frame basis, although audio compression 
is not as high as video stream compression. 

The resulting compressed video and audio streams are 
multiplexed to a single stream using a method known as the MPEG 

20 system. Fig. 37 shows the organization of an MPEG system stream. As 
shown in Fig. 37, each 2 KB sector comprises a pack header 41, packet 
header 42. and payload 43. The MPEG system thus has a hierarchical 
structure comprising packs and packets. Each packet comprises a packet 
header 42 and payload 43. AV data is segmented from the beginning into 

25 blocks of an appropriate size for storage to the payload 43. 

The packet header 42 records information referring to the AV 
data stored in the associated payload 43. More specifically, the packet 
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header 42 contains a stream ID for identifying the data stored in the 
associated packet, and a decoding time stamp (DTS) and presentation 
time stamp (PTS) identifying the decoding time and presentation time of 
the data contained in the payload in 90 kHz precision. If the decoding and 
5 presentation are simultaneous, as in the case of audio data, the DTS can 
be omitted. 

A pack is a unit of plural packets. In DVD-RAM, however, 
there is one pack for each packet, and each pack therefore comprises a 
pack header 41 and packet (containing a packet header 42 and payload 
10 43). 

The pack header contains a system clock reference (SCR) 
expressing with 27 MHz precision the time at which the data contained in 
this pack is input to the decoder buffer. 

An MPEG system stream thus comprised is recorded one pack 

15 to a sector (= 2048 bytes) on DVD-RAM. 

A decoder for decoding the above-noted MPEG system stream 
is described next below. Fig. 38 is a block diagram of an exemplary 
decoder model (P_STD) of an MPEG system stream decoder. Shown in 
Fig. 38 are the system time clock (STC) 51, that is, the internal reference 

20 clock for decoder operation; a demultiplexer 52 for decoding 
(demultiplexing) the system stream; video decoder input buffer (video 
buffer) 53; video decoder 54; re-ordering buffer 55 for temporarily storing 
I and P pictures to absorb the difference in the coding (data) sequence 
and presentation sequence that occurs between B pictures and I and P 

25 pictures; a switch 56 for adjusting the output order of the I, P, and B 
pictures buffered to the re-ordering buffer 55; an audio decoder input 
buffer (audio buffer) 57; and audio decoder 58. 
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This MPEG system decoder processes the above-noted MPEG 
system stream as follows. 

When the time indicated by the STC 51 and the SCR written 
to the pack header match, the pack is input to the demultiplexer 52. The 

5 demultiplexer 52 then interprets the stream ID in the packet header, and 
passes the audio stream and video stream contained in the payload data 
to the appropriate decoder buffers. The PTS and DTS are also read from 
the packet header. 

When the times indicated by the STC 51 and DTS match, the 

10 video decoder 54 reads and decodes the picture data from the video 
buffer 63. I and P pictures are stored to the re-ordering buffer 55 while B 
pictures are presented directly to screen. If the picture being decoded by 
the video decoder 54 is an I or P picture, the switch 56 switches to the re- 
ordering buffer 55 to output the previous I or P picture from the re- 

15 ordering buffer 55; if a B picture is decoded, the switch 56 switches to the 
video decoder 54. 

Similarly to the video decoder 54, the audio decoder 58 reads 
and decodes one audio frame of data from the audio buffer 57 when the 
PTS matches the STC 51 (a DTS is not recorded for audio data). 

20 An exemplary method of multiplexing an MPEG system stream 

is described next with reference to Fig. 39. Note that a sequence of video 
frames is shown in Fig. 39 (a), the change in data storage to the video 
buffer is shown in Fig. 39 (b). a typical MPEG system stream is shown in 
Fig. 39 (c), and an audio signal is shown in Fig. 39 (d). Each of Figs. 39 

25 (a) to (d) are shown on a common time base (horizontal axis). The 
vertical axis in Fig. 39 (b) indicates the amount of data stored to the video 
buffer. The bold line in this graph thus indicates the change over time in 
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the buffered video data volume. The slope of this line is indicative of the 
video bit rate, and shows that data is input to the video buffer at a 
constant rate. The decrease in buffered data at regular intervals indicates 
the progression of data decoding. The intersection of the dotted line 
5 extension of the graphed line with the time base (horizontal axis) 
indicates the time at which video frame transfer to the video buffer begins. 

MPEG encoding is described next using by way of example 
coding a complex image A in the video data stream. As shown in Fig. 39 
(b). image A requires a large coding block, and data transfer to the video 

1,0 buffer must therefore begin from a time t1 before the image A decoding 
time. Note that the time from data input start time t1 to decoding is 
referred to as vbv_delay below. AV data is thus multiplexed to the 
position (time) of the shaded video pack. 

Unlike video data, audio data does not require dynamic coding 

15 size control. It is therefore not necessary for audio data transfer to start 
at a similarly advanced time before decoding starts, and audio data is 
thus typically multiplexed only slightly before decoding starts. Video data 
is thus multiplexed to the MPEG system stream before the audio data. 

It should be further noted that data can be accumulated to the 

20 buffer for a limited time in the MPEG system. More specifically, the MPEG 
system standard requires all data other than still image data be output to 
the decoder from the buffer within one second of being stored to the 
buffer. This means that there is at most a one second offset between 
video data and audio data multiplexing (or more precisely, the time 

25 required for video frame reordering). 

It will also be obvious that while the MPEG system stream is 
described above with video data preceding the audio, the audio can 
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theoretically precede the video. This type of stream can be purposely 
generated by using for the video data simple images to which a high 
compression rate can be applied, and transferring the audio data earlier 
than required. Even in this case, however, the audio can precede the 
video by at most one second due to the restrictions imposed by the MPEG 
standard. 

AudiQ gtreqm format 9nd rgprodupt io n 

The format of the audio stream and a method for audio stream 
reproduction are described next below. 

As described above, data is recorded and reproduced from a 
linear recording area in sequential access media such as magnetic tape. 
A typical method for recording an audio stream to a plurality of tracks on 
a single tape is described next below with reference to Fig. 41. In this 
example a maximum of two audio streams, shown as audio stream 1 and 
audio stream 2, can be recorded for a single video stream. In this 
example audio stream 1 is a single audio channel, generally known as a 
monaural audio channel, and audio stream 2 comprises two audio 
channels, such as a stereo audio signal or two monaural streams enabling 
a bilingual recording. It is also possible to record only one of these two 
audio streams (audio stream 1 or audio stream 2) or to record no audio 
stream. However, reducing the amount of audio recorded cannot be used 
as a means for increasing the video storage capacity of the tape. In other 
words, the audio stream recording area, i.e., the audio track space, is 
reserved exclusively for audio content and cannot be used for any other 
application even when no audio stream is actually recorded. The user can 
also select which of the two audio streams and channels to play, and the 
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audio stream or channel selected by the user is reproduced 
simultaneously with the video. 

DVD-RAM and other disc media, however, allow for more 
flexible audio stream recording and reproduction. The number of audio 
5 streams and channels recorded simultaneously with a video stream can 
be varied for the plurality of audio streams recorded to a disc. 

Fig. 42 shows some of the ways in which the audio stream 
content can be varied with the video stream in a disc media. For example, 
AV stream 1 in Fig. 42 (a) comprises one audio stream for the video 

10 stream, and the audio stream in this case has only one channel. 

AV stream 2 in Fig. 42 (b) similarly comprises one audio 
stream for the same video stream, but the audio stream in this case 
comprises two channels, i.e., main and sub audio channels. In this case 
the audio stream contains two selectively reproducible audio channels, a 

15 first audio channel containing the main audio (such as a first language), 
and a second audio channel containing the auxiliary audio data of the sub 
channel (such as a second language). 

AV stream 3 in Fig. 42 (c) comprises two audio streams for 
the video stream. In this case audio stream 1 is a single monaural 

20 channel while audio stream 2 contains two channels. The beginning of 
this audio stream 2 is recorded in stereo and then switches to dual 
monaural audio content. More specifically, audio stream 2 in this example 
comprises at least two of the following three audio content areas: a 
second (stereo) area containing first and second simultaneously 

25 reproduced audio channel data; a first (dual monaural) area containing 
first and second audio channels of which only one is selected and 
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reproduced; and a third (monaural) area containing only one audio 
channel. 

It will also be obvious that audio content is not limited to 
these stereo, dual monaural, and monaural types, and this audio stream 2 
5 is simply illustrative of an audio stream containing a mix of different audio 
types. In the example shown in Fig. 42 (c), audio stream 2 contains a 
stereo and a dual monaural area. Exemplary stereo content might be the 
commercials in a television broadcast while the dual monaural content 
contains separate audio streams of a bilingual broadcast in. for example, 

10 Japanese and English. 

As noted above the relationship between video and audio 
streams on DVD-RAM and other disc media is flexible with the audio 
stream configuration being easily adapted according to the application 
and objective of the plural AV streams recorded to any same disc. It 

15 should be noted here that the AV stream configuration shown in Fig. 42 
mimics a tape track configuration simply for ease of illustration and 
understanding. The actual AV stream configuration is a multiplexed bit 
stream of video stream data and one or more audio streams as shown in 
the MPEG system stream In Fig. 39 (c). 

20 

PROBLEM TO BE SOLVED 

The object of the present invention is to provide a DVD 
recorder that solves the following problems hindering obtaining maximum 
performance from DVD-RAM media, a high capacity rewritable storage 
25 medium widely anticipated as the next generation in AV recording media. 

As described with reference to Fig. 42 (a) to (c), one or a 
plurality of audio streams can be freely recorded to DVD-RAM media with 
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each audio stream containing a variable number of channels. The user 
can tell the number of audio streams and the channel configuration 
thereof by reproducing the AV stream to which this audio stream is 
recorded, but cannot know this information by simply loading the disc into 
the DVD-RAM disc player 

An object of the present invention is therefore to provide a 
disc format whereby the user can know the audio stream configuration of 
a recorded AV stream before actually reproducing the AV stream. 

Our invention further makes it possible for the user to know 
the audio stream configuration for each of one or a plurality of AV 
streams recorded to a DVD-RAM disc when the disc is loaded into a DVD- 
RAM disc player. 

The greatest problem confronted by enabling a DVD recorder 
to record a flexible correlation between video streams and audio streams 
so that different audio stream configurations can be achieved for each AV 
stream on the disc is how to manage the data internally and how to 
present the data to the user. 

The internal data management technique must be able to 
manage a variety of audio stream configurations while also achieving 
recordihg, reproducing, and editing functions without introducing 
incompatibilities and content mismatches. 

A diverse range of audio stream configurations enables finite 
recording space (capacity) to be used effectively, and enables recording 
various AV streams according to particular user objectives, but at the 
same time can introduce confusion. In other words, when a user wants to 
play a particular AV stream, the user cannot select the appropriate audio 
stream and audio channel(s) if the user does not know what audio stream 
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configuration was recorded for the desired AV stream. For example, if 
both Japanese and English audio streams are recorded and both are 
selectable so that the user can listen in either language, the AV stream 
could be reproduced in the language not desired by the user as a result of 
how the disc player is set up. Depending upon the circumstances it is also 
possible for the user to manually reselect the desired audio stream when 
the user realizes that the undesired audio stream is being reproduced. 

There is therefore a need for a method of correctly selecting 
the desired audio stream and audio channel at the same time the desired 
AV stream is selected for reproduction. 

It is therefore an object of our invention to provide a 
rewritable disc, such as a DVD-RAM disc, formatted such that information 
about the audio stream and audio channel configuration can be 
appropriately presented to the user using management information 
recorded to the disc when a plurality of audio stream configurations is 
recorded for one video stream, and an audio stream to be reproduced can 
be automatically selected according to selection information defined by 
the user. 

SUMMARY OF THE INVENTION 

To achieve the above object, our invention relates to an 
optical disc for storing an AV stream containing a video stream and at 
least one audio stream, and management information for managing the 
AV stream. More particularly, our invention provides such an optical disc 
in which the audio stream recorded to the optical disc comprises an audio 
channel area containing first and second audio channels, one of which is 
selectively reproduced. In addition, the management information stores 
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preference information indicative of whether the first audio channel data 
or second audio channel data is preferentially reproduced when 
reproducing the audio channel area. 

Further preferably, the audio stream contains at least two of 
5 the following areas: a first area containing first audio channel data and 
second audio channel data, one of which is selectively reproduced; a 
second area containing simultaneously reproduced first audio channel 
data and second audio channel data; and a third area containing data for 
one audio channel. In this case, the management information additionally 

10 contains application information indicating that two or more different 
audio stream areas are contained in one audio stream. 

Our invention further provides a recording device for 
recording to an optical disc an AV stream containing a video stream and 
at least one audio stream, and management information for managing the 

15 AV stream. This recording device has an encoder (7804) for encoding an 
audio stream having an area containing first audio channel data and 
second audio channel data, one of which is selectively reproduced; a 
controller (7802) for generating management information containing 
preference information indicating whether the first audio channel data or 

20 second audio channel data is to be preferentially reproduced when the 
audio channel area is reproduced; and drive means (7807. 7808) for 
recording the encoded audio stream and management information to a 
data area of the optical disc. 

Further preferably, the encoder further encodes an audio 

25 stream containing at least two of the following areas: a first area 
containing first audio channel data and second audio channel data, one of 
which is selectively reproduced; a second area containing simultaneously 



reproduced first audio channel data and second audio channel data; and 
a third area containing data for one audio channel. The controller in this 
case generates management Information containing application 
information indicating that two or more different audio stream areas are 
contained in one audio stream. 

Our invention also provides a playback device for reproducing 
an AV stream containing a video stream and at least one audio stream, 
and management information for managing the AV stream, from an optical 
disc. This playback device has reading means (7807, 7808) for reading 
management information from a data area of the optical disc; extraction 
means (7802, Step #26) for extracting preference information contained in 
the management information; and a decoder (7806) for selecting and 
decoding audio channel data indicated by the preference information. The 
preference information indicates whether the first audio channel data or 
second audio channel data is to be preferentially reproduced when 
reproducing the area containing the first audio channel data or second 
audio channel data of which one is selectively reproduced. 

Further preferably, this playback device additionally has an 
output means (7805) for displaying the preferred audio channel 
determined from the preference information. 

Yet further preferably, the extraction means of this playback 
device also extracts application information indicating two or more areas 
of different type are mixed in a single audio stream; and the output means 
displays information indicating that a mixture of audio types is recorded to 
the AV stream. 

BRIEF DESCRIPTION OF THE DRAWINGS 
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These and other objects and features of the present invention 
will be readily understood from the following detailed description taken in 
conjunction with preferred embodiments thereof with reference to the 
accompanying drawings, in which like parts are designated by like 
reference numerals and in which: 

Fig. 1 shows the logical structure of a disc according to a 
preferred embodiment of the present invention; 

Fig. 2 shows the internal structure of an AV file for movies; 

Fig. 3 shows the internal structure of an AV file for still 

images; 

Fig. 4 shows the relationship between AV data and 
management information; 

Fig. 5 shows the structure of the RTR_VMG block; 

Fig. 6 shows the structure of the RTR_VMGI block; 

Fig. 7 shows the VERN and TM_ZONE format; 

Fig. 8 shows the structure of the PL_SRP block; 

Fig. 9 shows the PL_TY and PL_CREATE format; 

Fig. 10 shows the PTM format; 

Fig. 11 shows the S_VOB_ENTN format; 

Fig. 12 shows the structure of the M_AVFIT block; 

Fig. 13 shows the V_ATR and A_ATR format; 

Fig. 14 shows the SP_ATR and SP_PLT format for movies; 

Fig. 15 shows the structure of the M_AVFI block; 

Fig. 16 shows the structure of the M_VOBI block; 

Fig. 17 shows the VOB_TY format; 

Fig. 18 shows the structure of the TMAPI block; 

Fig. 19 shows the VOBU_ENT format; 
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Fig. 20 shows the structure of the S_AVFIT block; 

Fig. 21 shows the V^ATR and OA_ATRS_AA_STI format; 

Fig. 22 shows the SP_ATR and SP_PLT format for still 

images; 

5 Fig. 23 shows the structure of the S_AVFI block; 

Fig. 24 shows the structure of the S_VOB_ENT block; 

Fig. 25 shows the S_VOB_ENT_TY format; 

Fig. 26 shows the structure of the UD_PGCIT block; 

Fig. 27 shows the structure of the TXTDT_MG block; 
10 Fig. 28 shows the structure of the PGCI block; 

Fig. 29 shows the PG_TY format; 

Fig. 30 shows the structure of the CI block; 

Fig. 31 shows the C_TY format; 

Fig. 32 shows the structure of the C_EPI block; 
15 Fig. 33 shows the EP_TY1 format; 

Fig. 34 is a block diagram of a DVD recorder drive; 

Fig. 35 (a) shows the volume address space of a disc, and (b) 
shows the change In data accumulation in the track buffer; 

Fig. 36 shows the correlation between picture types in an 
20 MPEG video system stream; 

Fig. 37 shows the structure of an MPEG system stream; 

Fig. 38 is a block diagram of an MPEG system decoder 

(P_.STD); 

Fig. 39 (a) shows video data, (b) shows the change in data 
25 accumulation in the video buffer, (c) shows the MPEG system stream, and 
(d) shows the audio data; 

Fig. 40 is a block diagram of a DVD recorder; 
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Fig. 41 is used to describe the audio stream configuration in a 
conventional AV stream; 

Fig. 42 is used to describe audio stream configurations in an 
exemplary AV stream; 
5 Fig. 43 shows the relationship between management 

information stored to disc and a first exemplary audio stream 
configuration; 

Fig. 44 shows the relationship between management 
information stored to disc and a second exemplary audio stream 
10 configuration; 

Fig. 45 shows the relationship between management 
information stored to disc and a third exem.plary audio stream 
configuration; 

Fig. 46 is used to describe the management information 
15 structure written to optical disc; 

Fig. 47 is a flow chart of an operation for recording audio data 
management information to optical disc; 

Fig. 48 is a flow chart of an operation for presenting a 
program list indicating the content of a disc inserted to the disc player; 
20 Fig. 49 is a flow chart of an operation for reproducing a 

program recorded to a disc inserted to the disc player; and 

Fig. 50 shows an exemplary program list generated from a 
disc inserted to the disc player and presented for the user. 



25 DESCRIPTION OF THE PREFERRED EMBODIMENTS 
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A DVD recorder and DVD-RAM disc are described below as a 
preferred embodiment of the present invention with reference to the 
accompanying figures. 
Logical structure of DVD-RAM 
5 The logical structure of a DVD-RAM disc is described first 

below with reference to Fig. 1. Fig. 1 shows the physical sector address 
area of the disc, and the structure whereby data is recorded to the disc as 
part of a file system. 

The physical sector address area of the disc starts with a 

10 lead-in area to which a reference signal for servo stabilization, and an ID 
signal for differentiating DVD-RAM media from other media, are recorded. 
The user data area follows the lead-in area. Logically valid data is 
recorded to the user data area. A lead-out area ends the physical sector 
address area; a reference signal is also recorded here. 

15 File system management information, called volume 

information, is recorded at the beginning of the user data area. The file 
system is not directly related to the present invention, and description 
thereof is thus omitted below. It should be noted, however, that by using a 
file system, data recorded to the disc can be managed as files and a 

20 directory to the files as shown in Fig. 1. 

All data handled by the DVD recorder is filed under the 
DVD_RTR directory directly below the root directory as shown In Fig. 1. 

Files handled by a DVD recorder can be grouped into two 
broad categories: a management information file (RTR.IFO file) and one 

25 or more AV files (RTR^MOV.VRO file, RTR_STO.VRO file). 
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AV files are recorded as an RTR_MO\/.\/RO file recording 
moving picture content (referred to as video below), or an RTR_STO.VRO 
file recording still image data and simultaneously recorded audio data. 

Fig. 2 shows the file structure of an RTR_MOV.VRO file 
recording video content. As shown in Fig. 2, MPEG program streams 
(M_VOB (Movie Video Object)) are arranged in recording sequence in the 
RTR_MOV.VRO file. 

Each program stream (M_VOB) is built from a plurality of 
Video Object Units (VOBU), each with a video reproduction time of 0.4 
sec. to 1.0 sec. 

Each VOBU comprises a number of video packs (V_PCK), 
audio packs (A_PCK). and subpicture packs (SP_PCK); each pack is 2 KB. 

The video data in each VOBU further comprises one or more 
Group of Pictures (GOP). The GOP is the decoding unit for MPEG video, 
starts with an l-picture, and contains plural P- or B-pictures. 

Fig. 3 shows the structure of an RTR_STO.VRO file for 
recording still images and audio data. As shown in Fig. 3, an 
RTR_STO.VRO file contains S_VOB (Still Picture Video Objects), the 
MPEG program stream for still images, arranged in recording sequence. 

The greatest difference between an S_VOB and M_VOB is 
that an S_VOB records still image data instead of moving picture data, 
and the still image data (video part) is followed by the audio data (audio 
part) instead of multiplexing the video and audio. 

An S_VOB also contains one VOBU, which comprises a 
V_PCK, A_PCK. and SP_PCK. 

AV data and management information 
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The relationship between M_VOB, S_VOB, and management 
information is described next below with reference to Fig. 4. 

As described above, there are two types of AV data, M_VOB 
and S^VOB. Management information M^VOBI for each M_VOB is stored 
for each M_\/OB where the M_VOBI records attributes of the 
correspondrng M^VOB. Individually managing S_VOBs, however, would 
greatly increase the amount of management information. Management 
information S^VOGI is therefore used to manage a group S^VOG 
containing plural S_VOB units. This S_VOGI records attributes for the 
corresponding S_\/OB group. 

What is important to note here is that MPEG stream data does 
not have a linear correlation between time and data size. As noted above, 
the MPEG system stream is compressed using temporal correlation 
characteristics and variable length coding techniques (including variable 
bit rate coding) in order to achieve high compression efficiency. As a 
result there is not necessarily a direct correlation between time and data 
size (address). 

Therefore, an M^VOBI also contains a filter (TMAP) for 
converting time and address information, and an S_VOGI also contains a 
filter (S_VOB Entries) for converting a still Image number in an S_VOG 
group and address. 

Management information for the reproduction sequence is 
described next below. 

The reproduction sequence Is defined as a program chain 
(PGC) or sequence of cells describing all or part of a range of M_VOB or 
S^VOG blocks. 
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The reproduction sequence can be either of two types: an 
original PGC referring to all AV data on the disc, or a user-defined PGC 
defining a user-selected reproduction sequence of AV data on the disc. 
Note that a plurality of user-defined PGC can be recorded. 

The original PGC is also called a Program Set having a 
Program layer logically bundling a plurality of cells. 

A user-defined PGC Is also called a Play List. Unlike an 
original PGC, a Play List does not have a Program layer. 

Management Infor mation file 

The content of the management information file RTR.IFO is 
described next below with reference to Fig. 5 to Fig. .33. 
RTR_VMG (Fig. 5) 

The VR_MANGR.IFO file contains real-time recording video 
management information RTR_VMG. RTR_VMG comprises seven tables: 
RTR_VMGI, M_AVFIT, S_AVFIT, ORG_PGCI. UD_PGCIT, TXTDT_MG, 
and MNFIT. 

These seven tables are described in detail next below. 
RTR_VMGI (Fig. 6) 

Real-time recording video management information 
RTR_VMGI includes video management information table VMGI_MAT and 
play list search pointer table PL_SRPT. 
VMGLMAT (Fig. 6) 

The video management information management table 
VMGI_MAT stores the following information relating to the entire disc. 
The reproducing device and recording device, referred to as simply disc 
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player and recorder, respectively, below, first read this VMGLMAT to 
detect the overall structure of the disc. 

VMG_ID (video management identifier) 

Stores the identifier DVD_RTR_VMGO identifying the disc as 
storing video recording data. 

RTR_VMG_EA (RTR^VMG end address) 
Stores the RTR_VMG end address. 
VMGLEA (VMGI end address) 

Stores the VMGI end address. 
VERN (version number) 

Records the version number of the recording format of the 
stored video recording data according to the format shown in Fig. 7. 
TM_ZONE (time zone) 

Records the time zone used for all time information recorded 
to the disc. As shown in Fig. 7, the TM_ZONE stores a time zone stamp 
TZJTY indicating whether time information is based on Greenwich Mean 
Time or a regional time standard (such as Eastern Standard Time (EST) 
or Japan Standard Time (JST)). and a time zone offset TZ_OFFSET 
recording the time difference to Greenwich Mean Time. 
STILL_TM (still time) 

Stores the still time used for presenting still images without 

sound. 

CHRS (character set code for primary text display) 

Defines the character set code to use for primary text displays 
(described below). 

M^AVFIT^SA (M^AVFIT start address) 
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stores the start address of the movie AV file information table 
M_AVFIT. This start address is used in the seek operation for accessing 
the M_AVFIT table. 

S_AVFIT_SA (S_AVFIT start address) 

Stores the start address of the still image AV file information 
table S_AVFIT. This start address is used in the seek operation for 
accessing the S_AVFIT table. 

ORG_PGCI_SA (ORG_PGCI start address) 

Stores the start address of the original PGC information. This 
start address is used in the seek operation for accessing the original PGC. 
UD_PGCIT_SA (UD_PGCIT start address) 

Stores the start address of the user-defined PGC information 
table. This start address is used in the seek operation for accessing the 
user-defined PGC information table. 

TXTDT_MG_SA (TXTDT_MG start address) 

Stores the start address of the text data management 
information TXTDT_MG. This start address is used in the seek operation 
for accessing the text data management information TXTDT^MG. 
MNFIT_SA (MNFIT start address) 

Stores the start address of the management file information 
table MNFIT. This address is used in the seek operation for accessing the 
MNFIT table. 

PL_SRPT (play list search pointer table) (Fig. 8) 

The play list search pointer table PL_SRPT records play list 
search pointer table information PL_SRPTI and n play list search pointers 
PL SRP. 
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PL_SRPTI (play list search pointer table information) (Fig. 8) 

The play list search pointer table information PL_SRPTI 
records the following information for accessing a play list search pointer 
PL^SRP. 

PL_SRP_Ns (number of play list search pointers) 

Stores the number of play list search pointers PL_SRP. 

PL_SRPT_EA (PL^SRPT end address) 

Stores the end address of this play list search pointer table 

PL_SRPT. 

PL_SRP (play list search pointer) (Fig. 8) 

Records the following information for accessing the actual 
play list data, that is. the user-defined PGC. 
PL_TY (play list type) 

Stores one of the following values for identifying the play list 
type using the format shown in Fig. 9. 

0000b: video only 

0001b: still images only 

0010b: both video and still images 

001 1b: audio only 



PGCN (PGC number) 

Stores the PGC number for the associated play list. The PGC 
number is the recording sequence of PGC information in the UD_PGCIT 
described below. 

PL_CREATE_TM (play list creation date/time) 



26 



stores the date and time the play list was created according 
to the format shown in Fig. 9. 

PRM_TXTI (primary text information) 

Stores text information indicative of play list content. For 
example, if the play list is a television program, PRM_TXTI could record 
the name of the show. PRM_TXTI includes an ASCII code field, and a 
field for the character code set defined by the above-noted CHRS. 
IT_TXT_SRPN (IT_TXT_SRP number) 

If information indicative of the play list content is recorded as 
the optional IT_TXT block in addition to the above-noted primary text, the 
•T_TXT_SRP number is stored as a link to the IT_TXT recorded in 
TXTDT_MG. This IT__TXT_SRP number is the recording sequence in 
TXTDT^MG, described below. 

THM_PTRI (thumbnail pointer information) 

Stores thumbnail image information for the play list. 

THM^PTRI (Fig. 8) 

THM_PTRI stores the following information indicating a 
thumbnail image location. 
CN (cell number) 

Stores the cell number containing the thumbnail image. The 
cell number is the recording sequence of the cell information in the 
UD^PGCI for this play list. 

THM_PT (thumbnail image pointer) 

Stores the presentation time of the video frame used as the 
thumbnail image according to the PTM (presentation time) describing 
format as shown in Fig. 10 if the cell indicated by CN is a video cell. PTM 
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is written according to the reference time of the time stamp written in the 
MPEG program stream. 

Stores the still image VOB entry number of the still image 
used as the thumbnail image according to the S_VOB_ENTN describing 
format as shown in Fig. 1 1 if the cell indicated by CN is a still image cell. 

M^AVFIT (Fig. 12) 

The movie AV file information table M_AVFIT stores 
management information for the movie AV file RTR^MOV.VRO, and 
comprises M^AVFITI. M_VOB_STI, and M_AVFI. 

M_AVFITI (movie AV file information table information) (Fig. 12) 

Stores the following information for accessing M_VOB_STI 
and M_AVFI. 

M_AVFI_Ns (movie AV file information number) 

Indicates the number of following AVFI information fields. If 0, 
no AVFI is present; if 1. an AVFI is present. AVFI presence corresponds 
to the presence of movie AV file RTR_MOV.VRO. 
M_VOB_STLNs (M_VOB_STI number) 

Indicates the number of following M_VOB_STI fields. 
M^AVFIT^EA (M^AVFIT end address) 
Stores the M_AVFIT end address. 

M_VOB_STI (movie VOB stream information) (Fig. 12) 

Stores the following as movie VOB stream information. 
V_ATR (video attributes) 
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stores the following video attributes according to the format 
as shown in Fig. 13. 

Video compression mode 

Stores one of the following values indicating the video 
5 compression mode. 
00b: MPEG_1 
01b: MPEG_2 

TV system 

Stores one of the following values indicating the television 

10 system. 

00b: 525/60 (NTSC) 
01b: 625/50 (PAL) 
Aspect ratio 

Stores one of the following values indicating the aspect ratio. 

15 00b: 4x3 
01b: 16x9 

Iine21_switch_1 

Stores one of the following values indicating whether closed 
caption data for field 1 is contained in the video stream. 
20 lb: recorded 

Ob: not recorded 

Iine21_switch_2 

Stores one of the following values indicating whether closed 
caption data for field 2 is contained in the video stream. 
25 lb: recorded 

Ob: not recorded 

Video resolution 
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stores one of the following values indicating the video 

resolution. 

000b: 720x480 (NTSC), 720x576 (PAL) 
001b: 702x480 (NTSC). 702x576 (PAL) 
5 010b: 352x480 (NTSC). 352x576 (PAL) 
011b: 352x240 (NTSC), 352x288 (PAL) 
100b: 544x480 (NTSC), 544x576 (PAL) 
101b: 480x480 (NTSC). 480x576 (PAL) 
AST_Ns (audio stream number) 

Stores the number of audio streams recorded to the 
corresponding VOB. 

SPST_Ns (still picture stream number) 

Stores the number of still picture streams recorded to the 
corresponding VOB. 
15 A_ATRO (audio stream 0 attributes) 

Stores the following attributes for the audio recorded to audio 
stream 0 using the format as shown in Fig. 13. 
Audio coding mode 

Stores one of the following values indicating the audio 
20 compression method. 
000b: Dolby AC-3 

001b: MPEG audio without an extension stream 
010b: MPEG audio with an extension stream 
011b: linear PCM 
25 Preference flag 

Stores one of the following values indicating user preference 
information for the audio channel. 
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00b: not applicable 
01b: audio channel 1 
10b: audio channel 2 

For example, if audio channel 1 is in Japanese, audio channel 
2 is in English, and the user prefers to listen In English, this preference 
flag is set to 10b by the user. 
Application Flag 

Stores one of the following values indicating the audio 

application. 

00b: not applicable 

01b: plural audio channel configurations are mixed 
10b: enhancement channel included 

Note that a value of 01b indicating plural audio channel 
configurations are mixed means, for example, that two or more audio 
streams of monaural, stereo, or dual audio (such as in both Japanese and 
English) are recorded to the AV stream on separate time bases. 

The enhancement channel is an enhanced audio channel for 
the visually impaired. 

Quantlzation/DRC 

Stores one of the following values for identifying whether 
dynamic range control (DRC) information is present. 
00b: DRC not contained in MPEG stream 
01b: DRC contained in MPEG stream 

If LPCM is used, the following value is stored to Identify the 
quantization level. 
00b: 16 bit 
fs 



31 



The following value is stored to identify the sampling 

frequency. 
00b: 48 kHz 

Number of Audio channels 
5 Stores one of the following values indicating the number of 

audio channels. 
0000b: 1 channel (monaural) 
0001b: 2 channel (stereo) 
0010b: 3 channel 
10 0011b: 4 channel 
0100b: 5 channel 
0101b: 6 channel 
0110b: 7 channel 
0111b: 8 channel 
15 1001b: 2 channel (dual monaural) 

Dual monaural refers, for example, to a bilingual recording 
with main (e.g., Japanese) and sub (e.g., English) channels, both of 
which are monaural. 
Bitrate 

20 Stores one of the following values indicating the bitrate. 

0000 0001b: 64 kbps 

0000 0010b: 89 kbps 

0000 0011b: 96 kbps 

0000 0100b: 112 kbps 
25 0000 0101b: 128 kbps 

0000 0110b: 160 kbps 

0000 0111b: 192 kbps 
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0000 1000b: 224 kbps 

0000 1001b: 256 kbps 

0000 1010b: 320 kbps 

0000 1011b: 384 kbps 
5 0000 1100b: 448 kbps 

0000 1101b: 768 kbps 

0000 1110b: 1536 kbps 

What is important here is that if the corresponding audio 

stream is an MPEG audio stream with an extension stream, only the 
10 bitrate of the base stream, not including the extension stream, is recorded. 

This is because compression using a VLC technique is used for the 

extension stream, and the extension stream therefore cannot be defined 

using a fixed bitrate as above. 

A_ATR1 (audio stream 1 attributes) 
15 Stores the following attributes of audio stream 1 using the 

format as shown in Fig. 13. Note that these attributes are defined using 

the same fields used with A_ATRO and described above, and further 

description is thus omitted here. 

As shown in Fig. 43, if there are two audio streams (audio 
20 stream 1 and audio stream 2) for a single AV stream, A_ATRO is used for 

audio stream 1 management information, and A_ATR1 is used for audio 

stream 2 management information. Because A_ATRO and A_ATR1 are 

identical in structure, A_ATRO shown on the bottom in Fig. 13 is also 

applicable to A_ATR1. 
25 One possible application for two audio streams is to broadcast 

a baseball game, for example, with announcer commentary for one team 
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broadcast in stereo on audio stream 1, and the announcer commentary for 
the other team broadcast in stereo on audio stream 2. 

If there is only one audio stream, that is, audio stream 1 in 
this case as shown in Fig. 44. A_ATRO is used for the audio stream 1 
management information while A_ATR1 is left blank or as initialized. 

Furthermore, if audio streams 1 and 2 are both recorded for a 
single AV stream, A_ATRO is used for audio stream 1 management 
information, and A_ATR1 is used for audio stream 2 management 
information. By setting the preference flag to 10b in A_ATR1 as shown in 
Fig. 45, audio channel 2, that is, the subchannel, can be designated the 
preferred channel and selected with priority to audio channel 1. 
Furthermore, by setting the application flag to 01 b it is known that a 
plurality of audio channels are mixed. By further setting the number of 
audio channels to 1001b, it is known that two channel (dual monaural) 
audio is the preferred mode. Which is the preferred or representative 
mode when there are plural modes can be detected by. for example, 
comparing the total time of each mode and selecting the mode with the 
longest time, or sending a code in the broadcast signal indicating a 
preselected preferred mode. 

SP_ATR (subpicture attribute) 

Records the subpicture attribute information shown below 
according to the format as shown in Fig. 14. 

Application Flag 

Stores one of the following values indicating the application 

type. 

00b: not applicable 
01b: caption 
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10b: animation 

SP_PLT (subpicture color palette) 

Records the subpicture color palette information using the format 
shown in Fig. 14. 

5 

M_AVFI (Fig. 15) 

The movie AV file information M_AVFI comprises the following 
information for accessing a movie VOB: M_A\/FI_GI, M_VOBI_SRP, and 
M_VOBI. 

10 M_AVFI_GI (movie AV file general information) (Fig. 15) 

Stores the movie VOB information search pointer count 
M_VOBI_SRP_Ns. 

M_VOBI_SRP_Ns (movie VOB information search pointer number) 
Records the number of movie VOB information search 
15 pointers M_VOBI_SRP. 

M_VOBI_SRP (movie VOB information search pointer) (Fig. 15) 

Stores address information for accessing each M_VOBI. 
M_VOBI_SA (movie VOB information start address) 

Stores the M_VOBI start address used for a seek operation 
20 accessing the corresponding VOBI information. 

M_VOBI (movie VOB information) (Fig. 16) 

Stores the following movie VOB management information: 
M_VOB_GI, SMLI, AGAPI, TMAPI. and CP_MNGI. 
25 M_VOB_GI (general information) (Fig. 16) 

Records the following general information relating to a movie 

VOB. 
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VOB_TY (VOB type) 

Stores VOB attributes according to the format as shown in Fig. 

17. 

TE 

Stores one of the following values indicating the VOB status. 

Ob: normal 

1b: temporarily or partially deleted 
AO_STATUS 

Stores one of the following values indicating the status of 
audio stream 0. 
00b: original state 
01b: overwritten 
A1_STATUS 

Stores one of the following values indicating the status of 
audio stream 1. 
00b: original state 
01b: overwritten 

10b: dummy for additional audio content 
11b: additional audio content added 
APS 

Stores one of the following values indicating the analog copy 
prevention control signal state. 
00b: analog copy protection not enabled 
01b: type 1 
10b: type 2 
lib: type 3 

SML FLG 
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stores one of the following values indicating whether the VOB 
is to be seamlessly reproduced with the preceding VOB. 
Ob: seamless reproduction not possible 
1b: seamless reproduction possible 
A0_GAP_LOC 

Stores one of the following values indicating the presence of 
an audio reproduction gap in audio stream 0. and Identifying the VOBU to 
which the audio reproduction gap is multiplexed. 
00b: no audio reproduction gap recorded 
01b: audio reproduction gap multiplexed to first VOBU 
10b: audio reproduction gap multiplexed to second VOBU 
11b: audio reproduction gap multiplexed to third VOBU 
A1_GAP_L0C 

Stores one of the following values indicating the presence of 
an audio reproduction gap in audio stream 1, and identifying the VOBU to 
which the audio reproduction gap Is multiplexed. 
00b: no audio reproduction gap recorded 
01b: audio reproduction gap multiplexed to first VOBU 
10b: audio reproduction gap multiplexed to second VOBU 
lib: audio reproduction gap multiplexed to third VOBU 

VOB_REC_TM (VOB recording date/time) 

The date and time the VOB was recorded is stored In the 
same format used for PL_CREATE_TM shown In Fig. 9. What Is Important 
to note here Is that this indicates the date/time that the first video 
presentation frame of the VOB was recorded. If the first video frame is 
changed by editing or deletion, this VOB_REC_TM value must be updated. 
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It should be further noted that the date/time of recording can be displayed 
synchronized to the VOB presentation similarly to the way a date/time is 
displayed on the viewfinder of a video camcorder by simply adding the 
time elapsed in the VOB to the time stored as \/OB_REC_TM. 

VOB_REC_TM_SUB (VOB recording date/time difference 
information) 

This field is used to absorb error in a VOB_REC_TM field that 
has been updated because the first video frame in the VOB was changed 
by VOB editing or deletion. As shown in Fig. 9, VOB_REC_TM is only 
accurate to the second. This means that if the video was edited or deleted 
at the frame or field level (precision), the recording time cannot be 
expressed with sufficient accuracy using only VOB_REC_TM. This field is 
therefore used to adjust for any difference. 

M_VOB_STIN (M_VOB_STI number) 

Stores the M_VOB_STI number corresponding to the VOB. 
This M_VOB_STI number is the recording sequence in the above-noted 
M_VOB_STI table. 

VOB_V_S_PTM (VOB video start PTM) 

Stores the VOB presentation start time based on the same 
reference time as the time stamp of the video stream. 

VOB_V_E_PTM (VOB video end PTM) 

, Stores the VOB presentation end time based on the same 
reference time as the time stamp of the video stream. It should be noted 
that the time stamp of the stream indicates the presentation start time of 
the frame, but this VOB_V_E_PTM field records the presentation end time, 
that is, the sum of the start time plus the frame presentation period. 

SMLI (seamless information) (Fig. 16) 
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SMLI stores the following information required for seamless 
reproduction with the preceding VOB. Note that this field is only recorded 
when the above-noted SML^FLG is 1b. 
VOB_FIRST_SCR 

Stores the SCR of the first pack in the VOB. 
PREV_VOB_LAST_SCR 

Stores the SCR of the last pack in the previous VOB. 
AGAPI (audio gap information) (Fig. 16) 

AGAPI records the following information required for the 
decoder to process an audio reproduction gap. This field is only recorded 
when a value other than 00b is written to the above-noted A0_GAP_LOC 
or A1_GAP_LOC. 

VOB_A_STP_PTM (VOB audio stop PTM) 

Records the time of the audio reproduction gap, that is, the 
time at which the decoder is to temporarily stop audio reproduction. This 
time is recorded using the same reference time as the stream time stamp. 
VOB_A_GAP_LEN (VOB audio gap length) 

Records the length of the audio reproduction gap in 90 kHz 

precision. 

CP_MNGI (copy management information) (Fig. 16) 

Records the copy management information for the 
corresponding VOB, and comprises CPG_STATUS and CPGI. 
CPG_STATUS (copy protection status) 

Stores a value used for VOB copy protection. CPG_STATUS 
indicates whether content can be freely copied or whether only a first 
generation copy can be made. 
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CPGI (copy protection information) 

Records the copy protection information applied to the 
corresponding VOB. 

TMAPI (time map information) (Fig. 18) 

The time map information comprises TMAP_GI. TM_ENT, and 
VOBU_ENT fields. 

TMAP^GI (Fig. 18) 

The general TMAP information TMAP_GI comprises 
TM^ENT^Ns, VOBU_ENT_Ns, TM_OFS, and ADR^OFS fields as 
described below. 

TM_ENT_Ns (TM„ENT number) 

Records the number of TM_ENT fields in the TMAPI block as 
described below. 

VOBU_ENT_Ns (VOBU^ENT number) 

Records the number of VOBU_ENT fields in the TMAPI block 
as described below. 

TM_OFS (time offset) 

Records the time map offset with the video field precision. 
ADR_OFS (address offset) 

Records the offset in the first AV field in the VOB. 
TM_ENT (time entry) (Fig. 18) 

A time entry cornprises the following fields as access point 
information at a constant time interval TMU. If the video format is NTSC, 
the TMU is 600 video fields; if PAL. it is 500 video fields. 
VOBU^ENTN (VOBU^ENT number) 

Records the entry number of a VOBU containing the time 
(TMU X (N-1) + TM^OFS for the N-th TM.ENT) indicated by the TM_ENT 
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TM_DIFF (time difference) 

Records the difference between the time indicated by this 
TI\/1_ENT and the presentation start time of the VOBU pointed to by 
VOBU_ENTN. 

VOBU_ADR (VOBU address) 

Records the start address in the VOB of the VOBU pointed to 
by VOBU_ENTN. 

VOBU_ENT (Fig. 19) 

The VOBU entry (VOBU_ENT) has the fields shown below for 
the corresponding VOBU. The fields are formatted as shown in Fig. 19. 
The time and address information required to access a desired VOBU can 
be obtained by simply adding the following fields in sequence. 
1STREF_SZ 

Stores the number of packs from the first pack in the VOBU to 
the pack containing the last data block of the first l-plcture in the VOBU. 
VOBU_PB_TM 

Records the playback time of this VOBU. 
VOBU_SZ 

Records the data size of this VOBU. 

S_AVFIT (Fig. 20) 

The still image AV file information table comprises the 
following management information fields for the still image AV file 
RTR_STO.VRO: S_AVFITI. S_VOB_STI. S_AVFI. 

S_AVFITI (still image AV file information table information) (Fig. 20) 
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stores the following information required to access 
S_VOB_STI and S_AVFI. 

S_AVFI_Ns (still image AV file information number) 

This is a value of either 0 or 1. This value corresponds to the 
5 number of still image AV files, that is, RTR_STO.VRO file presence. 

S_VOB_STI_Ns (still image VOB stream information number) 

Records the number of S_VOB_STI described below. 
S_AVFI_EA (still image AV file information end address) 
Records the S_AVFI end address. 
10 S_VOB_STI (still image VOB stream information) (Fig. 20) 

Records the following still image VOB stream information. 
V_ATR (video attributes) 

Information recorded as the video attributes are the Video 
compression mode, TV system, Aspect ratio, and Video resolution. These 
15 fields are as described above with reference to the video attributes 
V_ATR of the M_VOB_STI. 

OA_ATR (audio stream attributes) 

The audio stream attribute fields are: Audio coding mode. 
Application Flag, Quantization/DRC, fs. Number of Audio channels. These 
20 are also as described above with reference to the A_ATRO fields of the 
M_VOB_STI. 

SP_ATR (subpicture attribures) 

The Application Flag is recorded for the subpicture attributes. 
This field is the same as SP_ATR described above with reference to 
25 M_VOB_STI. 

SP__PLT (subpicture color palette) 
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stores the color palette information for subpictures. The 
format is as described with reference to the SP_PLT of M_VOB_STI. 

S_AVFI (still image AV file information) (Fig. 23) 

Comprises the following fields required to access a still image 
VOG: S_AVFI_GI, S_VOGI_SRP. and S_VOGI. 
S_AVFI_GI (Fig. 23) 

General still image AV file information S_AVFI_GI records 
S_VOGI_SRP_Ns. 

S_VOGI_SRP_Ns (still image VOB group search pointer number) 

Records the number of S_VOGI_SRP fields described below. 
S_VOGI_SRP (still image VOB group information search pointer) 
(Fig. 23) 

Records S_VOGI_SA. 
S_VOGI_SA (still image VO^ group information start address) 
records the start address of this S_VOGI. 

S_VOGI (Fig. 23) 

The still image VOB group information S_VOGI comprises the 
following still image VOB management information fields: S_VOG_GI, 
S_VOB_ENT, CP_MNGI. 
S_VOG_GI (Fig. 23) 

General stilt image VOB group information S_VOG_GI records 
the following fields as general information relating to the still image VOB 
group. 

S_VOB_Ns (still image VOB number) 
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Records the number of still image VOBs in the stiii image VOB 

group. 

S_V0B_STIN (S_VOB_STI number) 

Records the S_VOB_STI number storing the still image VOB 
stream information. This S_\/OB_STI number is the recording sequence in 
the S_VOB_STI table. 

FIRST_VOB_REC_TM (first VOB recording date/time) 

Records the recording date/time information of the first still 
image VOB in the still image VOB group. 

LAST_VOB_REC_TM (last VOB recording date/time) 

Records the recording date/time information of the last still 
image VOB in the still image VOB group. 

S_VOB_SA (still image VOB group start address) 

Records the start address of the still image VOB group in the 
RTR_STO.VRO file. 

CP_MNGI (copy management information) 

Records copy management information relating to the 
corresponding still image VOB group. The fields thereof are the same as 
the above-described CP_MNGI for movie VOB information M_VOBI. 
S_VOB_ENT (Fig. 24) 

Still image VOB entries S_VOB_ENT are defined as either 
type A or type B as described below according to whether there is audio 
recorded for individual still image VOBs in the still image VOB group. 
S_VOB_ENT (Type A) (Fig. 24) 

Type A comprises the fields S_VOB_ENT_TY and 
V_PART_SZ, defined as follows. 

S_VOB_ENT_TY (still image VOB entry type) 
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. still image VOB type information is formatted as shown in Fig. 

25. 

MAP_TY 

Stores one of the following values for identifying type A or 

type B. 
00b: type A 
01b: type B 
TE 

Stores one of the following values indicating the status of the 
still image VOB. 
Ob: normal 

1b: temporarily or partially deleted 
SPST_,Ns 

Stores the number of subpicture streams in the stilt image 

VOB. 

V_PART_SZ (video part size) 

Stores the data size of the video part of the still image VOB. 
S_VOB_ENT (Type B) (Fig. 24) 

In addition to S_VOB_ENT_TY and V_PART_SZ fields, type B 
also has A_PART_SZ and A_PB_TM fields as defined below. 
S_VOB_ENT_TY (still image VOB entry type) 

Records the type of the still image VOB. These fields are as 
described above with reference to type A. 
V_PART_SZ (video part size) 

Stores the data size of the video part of the still image VOB. 
A_PART_SZ (audio part size) 

Stores the data size of the audio part of the still image VOB. 
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A_PB_TM (audio playback time) 

Stores the playback time (length) of the audio part of the still 

image VOB. 

UD^PGCIT (Fig. 26) 

The user-defined PGC information table comprises the 
following fields: UD_PGCITI. UD_PGCI_SRP. and UD_PGCI. 

UP_PGCITI (Fig. 26) 

The user-defined PGC information table information 
UD_PGCITI records the following fields constituting the user-defined PGC 
information table. 

UD_PGCI_SRP_Ns (user-defined PGC information search pointer 
number) 

Records the number of UD_PGCI_SRP fields. 
UD_PGCIT_EA (user-defined PGC information table end address) 

Records the UD_PGCIT end address. 
UD_PGCI_SRP (Fig. 26) 

The user-defined PGC information search pointer 
UD_PGCI_SRP records the UD_PGCI_SA field. 

UD_PGCI_SA (user-defined PGC information start address) 

Records the UD_PGCI start address. This address is used to 
seek and access the PGCI. 
UD_PGCI (Fig. 26) 

The detailed structure of the user-defined PGC information is 
described further below under the PGC information PGCI. 
ORG_PGCI (Fig. 5) 

The detailed structure of the original PGC information is 
described further below under the PGC information PGCI. 
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TXTDT^MG (Fig. 27) 

The text data management field TXTDT_MG comprises 
TXTDTl; IT^TXT^SRP, and IT_TXT fields as described below. ' 
TXTDTI (Fig. 27) 

Text data information TXTDTI comprises the following fields: 
CHRS, IT_TXT_SRP_Ns, TXTDT_MG_EA. 
CHRS (character set code) 

Records the character set code used for IT_TXT. 
IT_TXT_SRP_Ns (IT_TXT search pointer number) 
Records the number of IT_TXT_SRP fields. 
TXTDT_MG__EA (text data management end address) 

Records the end address of the TXTDT_MG block. 
IT_TXT_SRP (Fig. 27) 

The IT_TXT search pointer IT_TXT_SRP records the following 
information for accessing IT_TXT. 

IT_TXT_SA (IT^TXT start address) 

Records the IT^TXT start address. This address is used to 
seek and access the IT_TXT block. 
IT_TXT_SZ (IT^TXT size) 

Records the IT_TXT data size. A desired IT_TXT block can be 
read by reading this amount of data. 
IT_TXT (Fig. 27) 

IT_TXT comprises one or more sets of three fields: 
identification code IDCD. the text TXT corresponding to that ID code, and 
an end code TMCD defining the end of the set. If there is no TXT field for 
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an IDCD, the TXT field can be omitted and IDCD and TMCD recorded as 

one set. Valid IDCD values are defined as follow. 
Genre codes 

30h: movie - . ^ 

5 31 h: music 

32h: drama y 

33h: animation 

34h: sports 

35h: documentary 
10 36h: news 

37h: weather 

38h: educational 

39h: hobby 

3Ah: entertainment 
15 3Bh: performing arts (plays, opera) 

3Ch: shopping 

Input source codes 

60h: broadcasting station 

61h: camcorder 
20 62h: photograph 

63h: memo 

64h: other 

PGCI (Fig. 28) 

25 Original program chain information ORG_PGCI and user- 

defined program chain information UD_PGCI have a common data 
structure collectively referred to as program chain information PGCI. 
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PGGI comprises the following fields: PGC_GI (program chain general 
information), PGI (program information), CLSRP (cell information search 
pointer), and CI (cell information). 

PGC_GI (Fig. 28) - - . 

PGC^GI (PGC general information) comprises the fields 
PG_Ns (program number) and CLSRP^Ns (cell information search 
pointer number) as general information about the PGC. These fields are 
described further below. 

PG_Ns (program number) 

Records the number of programs in the PGC. If a user-defined 
PGC. this field is 0 because there is no program. 

CLSRP^Ns (CLSRP number) 

Records the number of cell information search pointers 
CLSRP, described below. 

PGI (Fig. 28) 

PGI (program information) comprises the following fields as 
described below: program type PG_TY, cell number C_Ns, primary text 
information PRM^TXTI, IT_TXT_SRPN, and THM^PTRL 
PG_TY (program type) 

Records the following Information formatted as shown in Fig. 

29. 

Protect (protected) 

Ob: normal 

1b: protected 
C_Ns (cell number) 

Records the cell number in the program. 
PRM_TXTI (primary text information) 
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Records the text information describing program content. For 
further details, see the above-noted PL_SRPT. 
IT_TXT_SRPN (IT_TXT_SRP number) 
' If IT_TXT containing program content information is recorded 

5 in addition to the primary text noted above, the IT_TXT_SRP number 
recorded in TXTDT_MG is stored to this field. 

THM^PTRI (thumbnail image pointer information) 

Records the thumbnail image information representing this 
program. Details about the THM_PTRI are identical to the above-noted 
10 THM^PTRI of PL^SRPT. 

CLSRP (Fig. 28) 

The cell information search pointer (CL$RP) records address 
information required for accessing this cell information. 
CI_SA (cell information start address) 
15 Records the start address of the cell information. The cell is 

accessed by seeking this address. 
CI (Fig. 30) 

CI (cell information) is one of two types: M_CI for movies, or 
S_CI for still image. 
20 M_CI (Fig. 30) 

M_CI (movie cell information) comprises the following fields: 
M_C_GI and M_C_EPI. 
M.C^GI (Fig. 30) 

M__C_GI (movie cell general information) contains the 
25 following basic information for each cell. 
C_TY (cell type) 
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Records the following information formatted as shown in Fig. 
31 for identifying movie cells and still image cells. 
C_TY1 

000b: movie cell 
5 001b: still image cell 

M_VOBI_SRPN (movie VOB information search pointer number) 

Records the search pointer number of the movie VOB 
information corresponding to this celL To access the stream data 
corresponding to this cell, it Is first necessary to access the movie VOB 
10 information search pointer number indicated by this field. 
C_EPI_Ns (cell entry point information number) 

Records the number of entry points in this cell. 
C_V_S_PTM (cell video start time) 

Records the playback start time of the cell using the format 
15 shown in Fig. 10. 

C_V_E_PTM (cell video end time) 

Records the playback end time of the cell using the format 
shown in Fig. 10. Used in conjunction with C_V_S_PTM to define the valid 
cell period within the corresponding VOB. 
20 M_C_EPI (Fig. 32) 

M_C_EPI (movie cell entry point information) is categorized 
as Type A or Type B based on the presence of primary text. 
M_C_EPI (Type A) (Fig. 32) 

M_C_EPI (Type A) contains the following information 
25 indicative of an entry point. 

EP_TY (entry point type) 
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Records the following information formatted as shown in Fig. 
33 for identifying the entry point type. 
EP_TY1 
00b: Type A 
5 01b: Type B 

EP_PTM (entry point time) 

Records the time at which the entry point is set according to 
the format as shown in Fig. 10. 
M_C_EPI (Type B) (Fig. 32) 
10 In addition to the same EP_TY and EP_PTM fields of Type A, 

M_C_EPI (Type B) has a PRM^TXTI field as described below. 
PRM^TXTI (primary text information) 

Records text information describing the content of the location 
indicated by the entry point. Details of this information are as described in 
15 the above-noted PL_SRPT. 
S_CI (Fig. 30) 

S_CI (still image cell information) comprises S_C_GI and 
S_C_EPI fields. 

S^C.GI (Fig. 30) 

20 S_C_GI (still image cell general information) contains the 

basic cell information described below. 
C_TY (cell type) 

Records information for identifying movie cells and still image 
cells. This cell type information is as described above with reference to a 
25 movie cell. 

S_VOGI_SRPN (still image VOB group information search pointer 
number) 
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Records the search pointer number of the still image VOB 
group information for the cell. To access the stream data corresponding 
to the cell, it is first necessary to access the still image VOB group 
information search pointer number indicated by this field. 
C_EPI_Ns (cell entry point information number) 

Records the number of entry points in this cell. 
S_S_VOB_ENTN (starting still image VOB number) 

Records the still image VOB number from which cell 
reproduction starts according to the format as shown in Fig. 11. The still 
image VOB number is the sequence number in the S_VOG pointed to by 
the above-noted S^VOGLSRPN. 

E_S_VOB_ENTN (end still image VOB number) 

Records the still image VOB number at which cell 
reproduction ends according to the format as shown in Fig. 11. The still 
image VOB number is the sequence number in the S_VOG pointed to by 
the above-noted S_VOGLSRPN. It should be noted that the valid cell 
period in the S_VOG to which the cell belongs is defined by this field in 
conjunction with S_S_VOB_ENTN. 
S^C^EPI (Fig. 32) 

S_C_EPI (still image cell entry point information) is 
categorized as Type A or Type B depending upon the presence of primary 
text. 

S_C_EPI (Type A) (Fig. 32) 

S_C_EPI (Type A) contains the following information 
indicative of an entry point. 

EP_TY (entry point type) 



53 



Records the following information formatted as shown in Fig. 
33 for identifying the entry point type. 
EP_TY1 

00b: Type A - 
01b: Type B 

S^VOB^ENTN (still image VOB entry number) 

Records the still image number in which the entry point is set 
according to the format as shown in Fig. 11. 
S_C_EPI (Type B) (Fig. 32) 

In addition to the same EP_TY and S_VOB_ENTN fields of 
Type A, S_C_EPI (Type B) has a PRM_TXT1 as described below. 
PRM_TXTI (primary text information) 

Records text information describing the content of the location 
indicated by the entry point. Details of this information are as described in 
the above-noted PL_SRPT. 

Configuration of a DVD recorder 

The configuration of a DVD recorder is described next below 
with reference to Fig. 40. 

As shown in the figure, this DVD recorder comprises a user 
interface 7801 for interaction with the user; a system controller 7802 for 
handling overall management and control of the recorder; an input block 
7803 comprising an A/D converter for audio and video input to the 
recorder; an encoder 7804; an output section 7805 for audio and video 
output; a decoder 7806 for MPEG stream decoding; track buffer 7807; and 
drive 7808. 
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Operation of a DVD recorder 

The basic recording and reproduction operation of a DVD 
recorder shown in Fig. 40 is described next below. 

A recording operation is described first below. 
5 Before recording starts, the input block 7803, encoder 7804, 

and track buffer 7807 are initialized by a command from the system 
controller 7802. Audio and video data input to the input block 7803 are 
A/D converted and passed to the encoder 7804. The encoder 7804 
compresses and multiplexes the video and audio data to generate a 
10 MPEG system stream, which it then passes to the track buffer 7807. Data 
is then passed sequentially from the track buffer 7807 to the drive for 
recorded to DVD-RAM disc. 

A reproduction operation is described next. 
After inserting a DVD-RAM disc to the disc player for 
15 reproduction, the user selects the AV stream to be played from among the 
plural AV streams on the disc. The method whereby the user selects the 
desired AV stream is described in particular detail below. 

As noted above, a plurality of AV streams can be recorded to 
a single disc. It is therefore essential that the disc player be able to 
20 present to the user a list of alt AV streams currently on the disc. 

As also described above each AV stream is managed using 
video object VOB units. Video object information VOBI is recorded 
specifically for each VOB to record and manage attribute information for 
each VOB. A sequence of one or a plurality of VOBs is managed using a 
25 program chain PG defining a hierarchical sequence of VOBs. This 
program concept is introduced so that bitstreams representing a linear 
stream of data can be managed when recorded for whatever reason as a 
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plurality of AV streams. This program concept is useful for managing 
separately recorded scenes as a meaningful sequence of related events. 
For example, an AV stream of a tug-of-war recorded during morning using 
a disc camera, and another AV stream of a relay race recorded in the 
afternoon could be linked and managed as a single program titled as a 
"school field day." Program information PGI is recorded for each program. 
The title associated with a particular program is recorded and managed 
within the attribute information for that program. 

The program is thus a useful unit for recording and 
reproducing AV content in a way that can be easily understood by and 
presented to the user, while the video object VOB is the unit whereby 
these AV streams are internally managed by the DVD recorder. 

Management information as related to the present invention 
and applied to actual AV stream data is described next below with 
reference to Fig. 46. 

Row L1 in Fig. 46 shows program chain information PGCI. 
The left block L1a shows the original program chain information 
ORG_PGCI required to reproduce an AV stream as it was recorded. The 
right block Lib and L1c show the user-defined program chain information 
UD_PGCI required to reproduce a program chain as edited and defined by 
the user. 

The hierarchical structure of the ORG_PGCI management 
information is shown in Fig. 5, Fig. 28, and Fig, 30. The hierarchical 
structure of the UD_PGCI management information is shown in Fig. 5, 
Fig. 26. Fig. 28. and Fig. 30. 

It should be noted that plural UD_PGCI can be recorded to a 
single disc. The UD_PGCIT (user defined program chain information 
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table) as shown in Fig. 5 is therefore used as a table of these UD_PGCI 
from which a user can select a particular UD_PGCI for reproduction. 

The title of the first recorded program can be obtained by 
tracing a path through the management information in the order described 
below with reference to Fig. 46. 

S1 in Fig. 5 -> S2 S3 in Fig. 28. 

The second row L2 in Fig. 46 shows the still image VOB 
management information S^VOBI and movie VOB management 
information M_VOBI. Up to 999 M_VOBI management information blocks 
can be recorded to an optical disc. The hierarchical structure of this 
M_VOBI management information is shown in Fig. 5, Fig. 15, and Fig. 16. 

Whether a cell in the program chain information PGCI in row 
L1 is associated with any movie VOB management information M_VOBI in 
row L2 can be known from the management information described below. 

S1 in Fig. 5 -> S2 -> S3 in Fig. 28 (C_Ns is the number of 
cells in the program. By counting the number of cells contained in a 
program in sequence from the first program, the number of a cell 
contained in the desired program is obtained. The number of the obtained 
cell is used as the cell search pointer CLSRP#n.) 

-> S5 S6 S7 (Obtain the cell address based on the cell 
search pointer.) 

S8 (Obtain the number of the address cell information.) 
Fig. 30, S9 (movie cell information M_CI) 
S10 (movie cell general information M_CGI.) 

—> S11 (movie VOB information search pointer number 
M_VOBLSRPN) 

Fig. 5. S12 (AV file information table) 
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-> Fig. 15. S13 S14 -> S15 (access the movie VOB 
information search pointer detected in S11) 

— ► S16 -> S17 (determine the movie VOB information start 

address) 
5 _S18->S19 

Row L3 in Fig. 46 shows the movie VOB stream information 
M_VOB_STL Application Flag and Preference Flag according to the 
present invention are written to M_VOB_STI. That is. as shown in Fig. 12. 
M_VOB_STI contains audio attributes A_ATRO and A_ATR1. As shown in 

10 Fig. 13, audio attributes A_ATRO and A_ATR1 contain an Application Flag 
(b17, b16) and Preference Flag (b19, b18). A maximum 64 M_VOB_STI 
can be written to one optical disc. One M_VOB_STJ is allocated to each 
one M_VOBI, and M_VOB_STI of identical content are often allocated to 
plural M_VOBI. In this case a common M_VOB_STI can be linked to 

15 multiple M_VOBI. For example, M_V0BI#1 and M_V0BI#2 are linked to a 
common M_VOB_STI#2 in Fig. 46. 

The link between a particular M_VOBI and any particular 
M_VOB_STI can be known from management information as described 
next below. 

20 In the above sequence of steps it was determined in step S19, 

Fig. 5, what movie VOB management information M_VOBI is linked to 
what cells in the program chain information PGCI in Fig. 46. The 
M_VOB_STI linked to movie VOB management information M_VOBI is 
obtained as follows. 

25 Fig. 5. S19 — S20 Fig. 16. S21 (M_VOB_STIN indicates a 

stream information number of the movie VOB). 
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The Application Flag (b17, b16) and Preference Flag are 
obtained from this M^VOB^STIN as follows. 

Fig. 5, S12 -> Fig. 12, S22 S23 S24 

Fig. 47 is a flow chart of a process whereby a DVD-RAM disc 
5 is recorded by a DVD recorder according to the present invention. The 
steps in this process are described below. 

Step #1: The disc recorder/player shown in Fig. 40 is 
initialized. N_AC1 memory. N_AC2 memory. AP_FLG1 memory. AP_FLG2 
memory, and PR_FLG memory in the system controller 7802 are 

10 initialized. N_AC1 and N_AC2 are for temporarily recording the number of 
audio channels in audio stream 1 and audio stream 2. PR_FLG stores the 
user's channel preference. For example, if dual monaural recordings in 
Japanese and English are present, and the user prefers to listen in 
English, the Preference Flag information indicating that audio channel 2 

15 should be preferentially reproduced is stored temporarily to memory. 
AP_FLG1 and AP_FLG2 store the audio stream 1 and audio stream 2 
application information, that is, whether a mixed combination of plural 
audio channels is recorded. Information indicative of, for example, 
whether monaural, stereo, and dual monaural audio recordings are mixed 

20 in the single AV stream selected by the particular M_VOB_STI is 
temporarily stored in memory. A typical mixed audio application as shown 
in Fig. 42 (c) might be a movie broadcast in Japanese and English using 
dual monaural channels and commercials recorded in stereo. 

In Step #1 in this example, memory N_AC1 and N_AC2 are 

25 initialized to 1111b. and AP_FLG1 and AP_FLG2 are initialized to 00b. 
where "b" indicates binary code. PR_FLG is set to the preset value 
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selected by the user by means of user interface 7801, or to the default 
value if no user preference has been defined. 

Step #2: The input block 7803 sends the A/D converted input 
signal data (note that while both audio and video data is received and 
5 converted, we focus herein on the audio data) and the audio channel 
information extracted from the input signal to the encoder 7804. 

Step #3: The encoder 7804 encodes the audio data according 
to the audio data configuration, sends the encoded audio data and video 
data, that is, the encoded AV stream data, to the track buffer, and sends 
10 the audio channel information to the system controller 7802. 

Step #4: The system controller 7802 converts the received 
audio channel information for audio stream 1 or audio stream 2 to the 
above-noted 4-bit ID code, and buffers this data to N_AC1 or N„AC2. For 
example, if the ID code is 0001b, the audio channel information is known 
15 to be two channel (stereo). 

Step #5: If the audio channel for audio stream 1 or audio 
stream 2 differs within the same AV stream from the audio channel 
information previously received by the system controller 7802, AP_FLG1 
or AP_FLG2 is set to 01b. For example, if different audio channel 
20 configurations are mixed within a single audio stream as shown in Fig. 42 
(c), the Application Flag is set to 01b as noted above, indicating that the 
audio stream contains a mixed audio channel configuration. 

Step #6: The system controller 7802 controls the drive to 
sequentially read the AV stream data from the track buffer 7807 and 
25 record it to disc. 

Step #7: Whether there is more data to be input to the same 
AV stream is then detected. If there is, the procedure loops back to step 
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#2 and repeats. If not, it advances to step #8. In other words, step #8 
begins when recording one AV stream is completed. 

Step #8: Recording the movie VOB management information 
M_VOBI for the AV stream (VOB) that has been completely recorded 
5 begins to the RTR.FO file. It should be noted that most of this 
management information is temporarily stored in memory by the system 
controller 7802. 

Step #9: Information in N_AC1 memory about the number of 

audio channels stored, and information in AP_FLG1 memory indicative of 
10 whether a mixed audio channel configuration is recorded, is recorded to 

ATRO in stream information M^VOB_STI, that is, to areas b11, b10, b9. 

and b8. and b7 and b16 in the A_ATRO format shown on the bottom in Fig. 

13. Data from memory N_AC2 and AP_FLG2 is likewise recorded in ATR1 

of stream information M_VOB_STI. 
15 Step #10: User preference information recorded to PR_FLG 

memory is recorded to ATRO in stream information M_VOB_STI, that is, to 

areas b19. b18 in the A_ATRO format shown on the bottom in Fig. 13. 

Other information is also recorded to complete M_VOB_STI, and recorded 

to the RTR.IFO file. 

20 Step #11: Cell information CI corresponding to the recorded 

VOB and program information PGI are generated and appended to the 
end of the original program chain information ORG_PGCI. 

As shown in Fig. 50, when a DVD-RAM disc according to the 
present invention is loaded into a disc player, a list of ail programs (for 

25 example, television dramas, sitcoms, documentaries, new movies, other) 
recorded to the DVD-RAM disc is presented by means of the output 
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section 7805 for the user to review. The configuration of the audio 
accompanying each program is also indicated in this program list. 

Fig. 48 is a flow chart of a process for presenting a program 
list, and particularly information relating to the audio content configuration, 
for the user! The example described below addresses presenting a 
program list for an original program, that is, program content not edited 
by the user. The steps in this process follow. 

Step #20: A counter N for counting the number of programs is 
reset to zero. 

Step #21: Counter N is incremented 1. 

Step #22: Program information PGI for the N-th program, is 

read. 

Step #23: The title is read from the PRM^TXTI field of the PGI. 

Step #24: The cell information CI for the PGI is read to obtain 
the movie VOB information search pointer number M_VOBI_SRPN. 

Step #25: Using M_\/OBI_SRPN, the corresponding movie 
VOB management information M_VOBI is read and the movie VOB stream 
information number M_VOB_STIN is detected. 

Step #26: Using M_VOB_STIN. A_ATRO in the movie VOB 
stream information M_VOB_STI of the corresponding M_VOB is read, and 
the Application Flag. Preference Flag, and audio channel count for audio 
stream 1 are detected. A_ATR1 is likewise read, and management 
information is found therein the Application Flag, Preference Flag, and 
audio channel count for audio stream 2 are also detected. 

Step #27: Whether there is a next program information PGI is 
determined. If there is, the procedure loops back to step #21; if not, the 
procedure advances to step #28. 
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step #28: Using the title detected in step #23 and the 
Application Flag, Preference Flag, and audio channel count detected in 
step #26. a program list screen is generated and presented. This program 
list also displays based on the information from A_ATRO whether for 
5 audio stream 1 there is a mixed configuration of plural audio channels, 
whether an enhancement channel is present, and the number of audio 
channels. Based on the information from A_ATR1, it is also indicated for 
audio stream 2 whether there is a mixed configuration of plural audio 
channels, whether an enhancement channel is present, and the number of 
10 audio channels. If both A_ATRO and A_ATR1 are present, the program list 
indicates that there are two audio streams available; if only A_ATRO 
information is present, the program list indicates there is only one audio 
stream available. 

Step #29: Referring to the displayed program list, the user 
15 selects a program by way of user interface 7801. 

Step #30: The disc player then plays the selected program. 
The process for this is shown in the flow chart of 49 and described below. 

A typical program list display is described below with 
reference to Fig. 50. 
20 The numbers 1. 2, 3 on the left side of the display are 

sequential program numbers. The titles are shown in the middle column. 
In this example the program list includes a television drama, overseas 
documentary, and a new movie. This information is stored to the 
PRM_TXTI field of the program information PGI of the original program 
25 chain information ORG_PGCI written to the optical disc. 

The third column (including two audio streams, monaural, and 
stereo, for example) shows the audio stream configuration of the 
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corresponding program. This information relates to the audio stream 
contained in the VOB referenced from a cell contained in the 
corresponding program, and is generated and displayed using the ATRO 
or ATR1 information recorded to the M_VOB_STI specified by 
5 M_VOB_STIN in the VOBI of the VOB. The user knows from this 
information the configuration of the audio stream contained in the 
program, and can thus determine what audio streams can be selected and 
the typical format of an audio stream when reproduced. 

For example, it can be determined before playback begins 

10 that if the user selects the "overseas documentary", audio stream 1 of the 
dual monaural audio content will be reproduced based on the information 
recorded to ATRO, and that in accordance with the Preference Flag of 
ATRO storing the user's preference, audio channel 1 (main audio) will be 
reproduced by default. 

15 Likewise, if the "new movie" is selected, it can be known from 

the information recorded to the Number Of Audio Channels and the 
Application Flag recorded to ATRO that audio stream 1 contains a mixture 
of dual monaural and other audio content, and that the user's preference 
specified by the Preference Flag is to listen to audio channel 2 (sub) in 

20 the representative dual monaural parts of the audio content. 

It will thus be obvious that information from ATRO and ATR1 
can be presented to the user when allowing the user to select the 
program to be reproduced. The user can thus know basic information 
about the various possible audio stream configurations, such as how 

25 many audio streams are available, the channel configuration of each 
audio stream, and whether the channel configuration includes multiple 
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modes mixed together, before a selection is made and playback starts, 
thus avoiding confusing the user when playback actually begins. 

It is also possible to set for each VOB whether to select audio 
channel 1 or 2 when the audio stream is a dual monaural configuration, 
and the user can record using the Preference Flag of ATRO and ATR1 
which audio channel is preferred. 

Moreover, if the disc player has a function for automatically 
selecting and reproducing the audio channel specified by the Preference 
Flag of a particular VOB. the user can preprogram the disc player once 
with the user's preferred audio channel selection so that the desired 
audio channel will always be automatically selected and played without 
the user needing to change the audio channel at all. . 

The playback operation performed after the user selects the 
program to be reproduced is described next below. 

The track buffer 7807, decoder 7806, and output section 7805 
are initialized by a command from the system controller 7802. The system 
controller 7802 then instructs the disc drive to seek the start address of 
the AV data in the first VOB of the selected program. The drive then 
begins reading the data and passes the read AV data to the track buffer 
7807. The decoder 7806 reads data from the track buffer 7807, extracts 
the decompressed AV data by applying an MPEG decoding operation, and 
passes the decompressed AV data to the output section 7805. The output 
section 7805 D/A converts the data and outputs the resulting analog AV 
data to the television or other AV device connected to the output terminal. 
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Fig. 49 is a flow chart of a playback process for reproducing 
a program selected from the presented program list. The steps in this 
process are described next below. 

Step #40: The cell information CI for the program information 
PGI of the selected program is read to obtain M_VOB_SRPN. 

Step #41: The corresponding M_VOBI is read from the 
detected M_VOB_SRPN to obtain the VOB data read address. 

Step #42: M_VOB_STIN is obtained from M^VOBI. 

Step #43: Using M_VOB_STIN. the decoder is set to the 
attributes of the corresponding VOB stream. It should be noted that the 
audio stream attributes used to setup decoder operation at this time are 
the audio stream attributes related to the preseat invention, such as 
information about the number of audio channels and whether there is a 
mixed configuration of multiple audio channels. 

Step #44: If there are two audio streams, one is selected by 
means of the user interface of the disc player. If there are multiple audio 
channels in the selected audio stream, or in the audio stream itself if 
there is only one audio stream from the start, and user preference 
information is also recorded, the audio channel indicated by the user 
preference information is selected. If preference information is not 
recorded, a default audio channel, such as audio channel 1, is selected. 

Step #45: The decoder is set to the selected audio channel. 

Step #46: VOB data is read from the read address, buffered to 
the track buffer, and supplied to the decoder. 

Step #47: The decoder decodes the audio channel selected in 

step #44. 
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step #48: Whether there is further VOB data to be read is 
determined. If there is, the procedure loops back tostep #40; if not, the 
process ends. 

It should be noted that the preferred embodiment of the 
invention is described above with reference to a DVD-RAM disc, but it will 
be obvious to one with ordinary skill in the related art that any rewritable 
optical disc can be alternatively used. Also, the invention is not restricted 
to optical disc, but can be applied to a hard disk or even distributable 
flash memory, i.e., a rewritable, random accessible storage medium of 
sufficient capacity. 

In addition, preference information regarding two audio 
channels of a dual monaural configuration are used as the Preference 
Flag values stored to ATR0/ATR1 above, but preference information 
defining a preferred selection from among plural audio streams can be 
alternatively used. 

Yet further, the Application Flag information stored to 
ATR0/ATR1 as described above indicates whether a mixed configuration 
of plural audio channels is recorded, or whether an audio enhancement 
channel for the visually impaired is recorded. Other possible Application 
Flag values include information indicating whether audio in a particular 
language is recorded. The DVD recorder as referred to above is also 
assumed to have both recording and playback capabilities in a single unit, 
but the benefits of the present invention can also be achieved using a 
dedicated disc recorder and a separate dedicated disc player. 

Benefits of the invention 
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By recording information indicating the type of an audio 
stream configuration contained in AV data recorded to optical disc, 
meaningful information enabling the user to select a desired AV stream 
for reproduction can be presented to the user, thereby avoiding user 
5 confusion and making the optical disc and disc player easier to operate 
and use. 

Furthermore, by recording user preference information, it is 
additionally possible to provide a disc player capable of automatically 
selecting and reproducing an appropriate audio channel conforming to the 

10 user preference. 

Although the present invention has been described in 
connection with the preferred embodiments thereof with reference to the 
accompanying drawings, it is to be noted that various changes and 
modifications will be apparent to those skilled in the art. Such changes 

15 and modifications are to be understood as included within the scope of 
the present invention as defined by the appended claims, unless they 
depart therefrom. 
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